Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Comput Struct Biotechnol J ; 20: 6473-6489, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36467574

RESUMO

Flow cytometry has become a powerful technology for studying microbial community dynamics and ecology. These dynamics are tracked over long periods of time based on two-parameter community fingerprints consisting of subsets of cell distributions with similar cell properties. These subsets are highlighted by cytometric gates which are assembled into a gate template. Gate templates then are used to compare samples over time or between sites. The template is usually created manually by the operator which is time consuming, prone to human error and dependent on human expertise. Manual gating thus lacks reproducibility, which in turn might impact ecological downstream analyses such as various diversity parameters, turnover and nestedness or stability measures. We present a new version of our flowEMMi algorithm - originally designed for an automated construction of a gate template, which now (i) generates non-overlapping elliptical gates within a few minutes. Gate templates (ii) can be created for both single measurements and time-series measurements, allowing immediate downstream data analyses and on-line evaluation. Furthermore, it is possible to (iii) adjust gate sizes to Gaussian distribution confidence levels. This automatic approach (iv) makes the gate template creation objective and reproducible. Moreover, it can (v) generate hierarchies of gates. flowEMMi v2 is essential not only for exploratory studies, but also for routine monitoring and control of biotechnological processes. Therefore, flowEMMi v2 bridges a crucial bottleneck between automated cell sample collection and processing, and automated flow cytometric measurement on the one hand as well as automated downstream statistical analysis on the other hand.

2.
Evol Bioinform Online ; 15: 1176934319871919, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31496634

RESUMO

With the discovery of increasingly more functional noncoding RNAs (ncRNAs), it becomes eminent to more strongly consider them as important players during species evolution. Although tests for negative selection of ncRNAs already exist since the beginning of this century, the SSS-test is the first one for also investigating positive selection. When analyzing selection in ncRNAs, it should be taken into account that selection pressures can independently act on sequence and structure. We applied the SSS-test to explore the evolution of ncRNAs in primates and identified more than 100 long noncoding RNAs (lncRNAs) that might evolve under positive selection in humans. With this test, it is now possible to more thoroughly include ncRNAs into evolutionary studies.

3.
BMC Bioinformatics ; 20(1): 151, 2019 Mar 21.
Artigo em Inglês | MEDLINE | ID: mdl-30898084

RESUMO

BACKGROUND: Long non-coding RNAs (lncRNAs) play an important role in regulating gene expression and are thus important for determining phenotypes. Most attempts to measure selection in lncRNAs have focused on the primary sequence. The majority of small RNAs and at least some parts of lncRNAs must fold into specific structures to perform their biological function. Comprehensive assessments of selection acting on RNAs therefore must also encompass structure. Selection pressures acting on the structure of non-coding genes can be detected within multiple sequence alignments. Approaches of this type, however, have so far focused on negative selection. Thus, a computational method for identifying ncRNAs under positive selection is needed. RESULTS: We introduce the SSS-test (test for Selection on Secondary Structure) to identify positive selection and thus adaptive evolution. Benchmarks with biological as well as synthetic controls yield coherent signals for both negative and positive selection, demonstrating the functionality of the test. A survey of a lncRNA collection comprising 15,443 families resulted in 110 candidates that appear to be under positive selection in human. In 26 lncRNAs that have been associated with psychiatric disorders we identified local structures that have signs of positive selection in the human lineage. CONCLUSIONS: It is feasible to assay positive selection acting on RNA secondary structures on a genome-wide scale. The detection of human-specific positive selection in lncRNAs associated with cognitive disorder provides a set of candidate genes for further experimental testing and may provide insights into the evolution of cognitive abilities in humans. AVAILABILITY: The SSS-test and related software is available at: https://github.com/waltercostamb/SSS-test . The databases used in this work are available at: http://www.bioinf.uni-leipzig.de/Software/SSS-test/ .


Assuntos
Estrutura Secundária de Proteína/fisiologia , RNA/metabolismo , Humanos
4.
Algorithms Mol Biol ; 13: 15, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30258487

RESUMO

BACKGROUND: Genome sequences and genome annotation data have become available at ever increasing rates in response to the rapid progress in sequencing technologies. As a consequence the demand for methods supporting comparative, evolutionary analysis is also growing. In particular, efficient tools to visualize-omics data simultaneously for multiple species are sorely lacking. A first and crucial step in this direction is the construction of a common coordinate system. Since genomes not only differ by rearrangements but also by large insertions, deletions, and duplications, the use of a single reference genome is insufficient, in particular when the number of species becomes large. RESULTS: The computational problem then becomes to determine an order and orientations of optimal local alignments that are as co-linear as possible with all the genome sequences. We first review the most prominent approaches to model the problem formally and then proceed to showing that it can be phrased as a particular variant of the Betweenness Problem. It is NP hard in general. As exact solutions are beyond reach for the problem sizes of practical interest, we introduce a collection of heuristic simplifiers to resolve ordering conflicts. CONCLUSION: Benchmarks on real-life data ranging from bacterial to fly genomes demonstrate the feasibility of computing good common coordinate systems.

5.
Bioinformatics ; 34(15): 2676-2678, 2018 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-29554223

RESUMO

Summary: A standard method for the identification of novel RNAs or proteins is homology search via probabilistic models. One approach relies on the definition of families, which can be encoded as covariance models (CMs) or Hidden Markov Models (HMMs). While being powerful tools, their complexity makes it tedious to investigate them in their (default) tabulated form. This specifically applies to the interpretation of comparisons between multiple models as in family clans. The Covariance model visualization tools (CMV) visualize CMs or HMMs to: I) Obtain an easily interpretable representation of HMMs and CMs; II) Put them in context with the structural sequence alignments they have been created from; III) Investigate results of model comparisons and highlight regions of interest. Availability and implementation: Source code (http://www.github.com/eggzilla/cmv), web-service (http://rna.informatik.uni-freiburg.de/CMVS). Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Conformação de Ácido Nucleico , Conformação Proteica , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Análise de Sequência de RNA/métodos , Software , RNA/química , RNA/metabolismo
6.
J Theor Biol ; 438: 143-150, 2018 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-29175608

RESUMO

The Human Accelerated Region 1 (HAR1) is the most rapidly evolving region in the human genome. It is part of two overlapping long non-coding RNAs, has a length of only 118 nucleotides and features 18 human specific changes compared to an ancestral sequence that is extremely well conserved across non-human primates. The human HAR1 forms a stable secondary structure that is strikingly different from the one in chimpanzee as well as other closely related species, again emphasizing its human-specific evolutionary history. This suggests that positive selection has acted to stabilize human-specific features in the ensemble of HAR1 secondary structures. To investigate the evolutionary history of the human HAR1 structure, we developed a computational model that evaluates the relative likelihood of evolutionary trajectories as a probabilistic version of a Hamiltonian path problem. The model predicts that the most likely last step in turning the ancestral primate HAR1 into the human HAR1 was exactly the substitution that distinguishes the modern human HAR1 sequence from that of Denisovan, an archaic human, providing independent support for our model. The MutationOrder software is available for download and can be applied to other instances of RNA structure evolution.


Assuntos
Evolução Molecular , RNA não Traduzido/química , RNA não Traduzido/genética , Humanos , Modelos Biológicos , Mutação/genética , Conformação de Ácido Nucleico , Probabilidade , Fatores de Tempo
7.
J Math Biol ; 77(2): 313-341, 2018 08.
Artigo em Inglês | MEDLINE | ID: mdl-29260295

RESUMO

Clusters of paralogous genes such as the famous HOX cluster of developmental transcription factors tend to evolve by stepwise duplication of its members, often involving unequal crossing over. Gene conversion and possibly other mechanisms of concerted evolution further obfuscate the phylogenetic relationships. As a consequence, it is very difficult or even impossible to disentangle the detailed history of gene duplications in gene clusters. In this contribution we show that the expansion of gene clusters by unequal crossing over as proposed by Walter Gehring leads to distinctive patterns of genetic distances, namely a subclass of circular split systems. Furthermore, when the gene cluster was left undisturbed by genome rearrangements, the shortest Hamiltonian paths with respect to genetic distances coincide with the genomic order. This observation can be used to detect ancient genomic rearrangements of gene clusters and to distinguish gene clusters whose evolution was dominated by unequal crossing over within genes from those that expanded through other mechanisms.


Assuntos
Modelos Genéticos , Família Multigênica , Álcool Desidrogenase/genética , Algoritmos , Animais , Simulação por Computador , Troca Genética , Evolução Molecular , Duplicação Gênica , Genes Homeobox , Genoma , Humanos , Conceitos Matemáticos , Filogenia , Recombinação Genética
8.
Algorithms Mol Biol ; 12: 22, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28852417

RESUMO

BACKGROUND: Genomic DNA frequently undergoes rearrangement of the gene order that can be localized by comparing the two DNA sequences. In mitochondrial genomes different mechanisms are likely at work, at least some of which involve the duplication of sequence around the location of the apparent breakpoints. We hypothesize that these different mechanisms of genome rearrangement leave distinctive sequence footprints. In order to study such effects it is important to locate the breakpoint positions with precision. RESULTS: We define a partially local sequence alignment problem that assumes that following a rearrangement of a sequence F, two fragments L, and R are produced that may exactly fit together to match F, leave a gap of deleted DNA between L and R, or overlap with each other. We show that this alignment problem can be solved by dynamic programming in cubic space and time. We apply the new method to evaluate rearrangements of animal mitogenomes and find that a surprisingly large fraction of these events involved local sequence duplications. CONCLUSIONS: The partially local sequence alignment method is an effective way to investigate the mechanism of genomic rearrangement events. While applied here only to mitogenomes there is no reason why the method could not be used to also consider rearrangements in nuclear genomes.

9.
Cell Mol Life Sci ; 74(4): 747-760, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27677491

RESUMO

We have analyzed the binding of the liver-specific microRNA-122 (miR-122) to three conserved target sites of hepatitis C virus (HCV) RNA, two in the non-structural protein 5B (NS5B) coding region and one in the 3' untranslated region (3'UTR). miR-122 binding efficiency strongly depends on target site accessibility under conditions when the range of flanking sequences available for the formation of local RNA secondary structures changes. Our results indicate that the particular sequence feature that contributes most to the correlation between target site accessibility and binding strength varies between different target sites. This suggests that the dynamics of miRNA/Ago2 binding not only depends on the target site itself but also on flanking sequence context to a considerable extent, in particular in a small viral genome in which strong selection constraints act on coding sequence and overlapping cis-signals and model the accessibility of cis-signals. In full-length genomes, single and combination mutations in the miR-122 target sites reveal that site 5B.2 is positively involved in regulating overall genome replication efficiency, whereas mutation of site 5B.3 showed a weaker effect. Mutation of the 3'UTR site and double or triple mutants showed no significant overall effect on genome replication, whereas in a translation reporter RNA, the 3'UTR target site inhibits translation directed by the HCV 5'UTR. Thus, the miR-122 target sites in the 3'-region of the HCV genome are involved in a complex interplay in regulating different steps of the HCV replication cycle.


Assuntos
Regulação Viral da Expressão Gênica , Hepacivirus/fisiologia , Hepatite C/metabolismo , MicroRNAs/metabolismo , RNA Viral/genética , Proteínas não Estruturais Virais/genética , Replicação Viral , Regiões 3' não Traduzidas , Regiões 5' não Traduzidas , Proteínas Argonautas/metabolismo , Sequência de Bases , Genoma Viral , Células HeLa , Hepacivirus/química , Hepacivirus/genética , Hepatite C/genética , Hepatite C/virologia , Humanos , Conformação de Ácido Nucleico , Biossíntese de Proteínas , RNA Viral/química , RNA Viral/metabolismo
10.
Mol Phylogenet Evol ; 106: 209-216, 2017 01.
Artigo em Inglês | MEDLINE | ID: mdl-27693569

RESUMO

Mitochondrial genome sequences are available in large number and new sequences become published nowadays with increasing pace. Fast, automatic, consistent, and high quality annotations are a prerequisite for downstream analyses. Therefore, we present an automated pipeline for fast de novo annotation of mitochondrial protein-coding genes. The annotation is based on enhanced phylogeny-aware hidden Markov models (HMMs). The pipeline builds taxon-specific enhanced multiple sequence alignments (MSA) of already annotated sequences and corresponding HMMs using an approximation of the phylogeny. The MSAs are enhanced by fixing unannotated frameshifts, purging of wrong sequences, and removal of non-conserved columns from both ends. A comparison with reference annotations highlights the high quality of the results. The frameshift correction method predicts a large number of frameshifts, many of which are unknown. A detailed analysis of the frameshifts in nad3 of the Archosauria-Testudines group has been conducted.


Assuntos
Genoma Mitocondrial , Animais , Sequência de Bases , Aves/classificação , DNA Mitocondrial/química , DNA Mitocondrial/classificação , DNA Mitocondrial/metabolismo , Bases de Dados Genéticas , Mutação da Fase de Leitura , Cadeias de Markov , Proteínas Mitocondriais/química , Proteínas Mitocondriais/classificação , Proteínas Mitocondriais/genética , Proteínas Mitocondriais/metabolismo , Dados de Sequência Molecular , Filogenia , Répteis/classificação , Alinhamento de Sequência
11.
Nucleic Acids Res ; 44(17): 8433-41, 2016 09 30.
Artigo em Inglês | MEDLINE | ID: mdl-27330139

RESUMO

Determining the function of a non-coding RNA requires costly and time-consuming wet-lab experiments. For this reason, computational methods which ascertain the homology of a sequence and thereby deduce functionality and family membership are often exploited. In this fashion, newly sequenced genomes can be annotated in a completely computational way. Covariance models are commonly used to assign novel RNA sequences to a known RNA family. However, to construct such models several examples of the family have to be already known. Moreover, model building is the work of experts who manually edit the necessary RNA alignment and consensus structure. Our method, RNAlien, starting from a single input sequence collects potential family member sequences by multiple iterations of homology search. RNA family models are fully automatically constructed for the found sequences. We have tested our method on a subset of the Rfam RNA family database. RNAlien models are a starting point to construct models of comparable sensitivity and specificity to manually curated ones from the Rfam database. RNAlien Tool and web server are available at http://rna.tbi.univie.ac.at/rnalien/.


Assuntos
Algoritmos , Modelos Moleculares , RNA/química , Pareamento de Bases , Humanos , Conformação de Ácido Nucleico , Homologia de Sequência do Ácido Nucleico , Fatores de Tempo
12.
Artigo em Inglês | MEDLINE | ID: mdl-26357262

RESUMO

We develop a theory of algebraic operations over linear and context-free grammars that makes it possible to combine simple "atomic" grammars operating on single sequences into complex, multi-dimensional grammars. We demonstrate the utility of this framework by constructing the search spaces of complex alignment problems on multiple input sequences explicitly as algebraic expressions of very simple one-dimensional grammars. In particular, we provide a fully worked frameshift-aware, semiglobal DNA-protein alignment algorithm whose grammar is composed of products of small, atomic grammars. The compiler accompanying our theory makes it easy to experiment with the combination of multiple grammars and different operations. Composite grammars can be written out in L(A)T(E)X for documentation and as a guide to implementation of dynamic programming algorithms. An embedding in Haskell as a domain-specific language makes the theory directly accessible to writing and using grammar products without the detour of an external compiler. Software and supplemental files available here: http://www.bioinf. uni-leipzig.de/Software/gramprod/.


Assuntos
Algoritmos , Biologia Computacional/métodos , Dobramento de Proteína , Alinhamento de Sequência/métodos , Sequência de Aminoácidos , Sequência de Bases , Humanos , Semântica
13.
Nucleic Acids Res ; 43(16): 8044-56, 2015 Sep 18.
Artigo em Inglês | MEDLINE | ID: mdl-26227972

RESUMO

Remolding of tRNAs is a well-documented process in mitochondrial genomes that changes the identity of a tRNA. It involves a duplication of a tRNA gene, a mutation that changes the anticodon and the loss of the ancestral tRNA gene. The net effect is a functional tRNA that is more closely related to tRNAs of a different alloacceptor family than to tRNAs with the same anticodon in related species. Beyond being of interest for understanding mitochondrial tRNA function and evolution, tRNA remolding events can lead to artifacts in the annotation of mitogenomes and thus in studies of mitogenomic evolution. Therefore, it is important to identify and catalog these events. Here we describe novel methods to detect tRNA remolding in large-scale data sets and apply them to survey tRNA remolding throughout animal evolution. We identify several novel remolding events in addition to the ones previously mentioned in the literature. A detailed analysis of these remoldings showed that many of them are derived from ancestral events.


Assuntos
Evolução Molecular , Genoma Mitocondrial , RNA de Transferência/genética , Animais , Anticódon , Códon , Crustáceos/genética , Mutação , Poríferos/genética , RNA de Transferência de Leucina/genética , Alinhamento de Sequência
14.
RNA ; 21(6): 1110-21, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-25904133

RESUMO

A 3D model of RNA structure can provide information about its function and regulation that is not possible with just the sequence or secondary structure. Current models suffer from low accuracy and long running times and either neglect or presume knowledge of the long-range interactions which stabilize the tertiary structure. Our coarse-grained, helix-based, tertiary structure model operates with only a few degrees of freedom compared with all-atom models while preserving the ability to sample tertiary structures given a secondary structure. It strikes a balance between the precision of an all-atom tertiary structure model and the simplicity and effectiveness of a secondary structure representation. It provides a simplified tool for exploring global arrangements of helices and loops within RNA structures. We provide an example of a novel energy function relying only on the positions of stems and loops. We show that coupling our model to this energy function produces predictions as good as or better than the current state of the art tools. We propose that given the wide range of conformational space that needs to be explored, a coarse-grain approach can explore more conformations in less iterations than an all-atom model coupled to a fine-grain energy function. Finally, we emphasize the overarching theme of providing an ensemble of predicted structures, something which our tool excels at, rather than providing a handful of the lowest energy structures.


Assuntos
Biologia Computacional/métodos , RNA/química , Algoritmos , Ligação de Hidrogênio , Modelos Moleculares , Conformação de Ácido Nucleico
15.
Artigo em Inglês | MEDLINE | ID: mdl-24334379

RESUMO

G-quadruplexes are abundant locally stable structural elements in nucleic acids. The combinatorial theory of RNA structures and the dynamic programming algorithms for RNA secondary structure prediction are extended here to incorporate G-quadruplexes using a simple but plausible energy model. With preliminary energy parameters, we find that the overwhelming majority of putative quadruplex-forming sequences in the human genome are likely to fold into canonical secondary structures instead. Stable G-quadruplexes are strongly enriched, however, in the 5'UTR of protein coding mRNAs.


Assuntos
Quadruplex G , Conformação de Ácido Nucleico , RNA Mensageiro/química , Regiões 5' não Traduzidas , Sequência de Bases , Biologia Computacional , Humanos , Dobramento de RNA , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Alinhamento de Sequência , Análise de Sequência de RNA , Termodinâmica
16.
Nucleic Acids Res ; 41(22): 9999-10009, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24005040

RESUMO

Recent progress in predicting RNA structure is moving towards filling the 'gap' in 2D RNA structure prediction where, for example, predicted internal loops often form non-canonical base pairs. This is increasingly recognized with the steady increase of known RNA 3D modules. There is a general interest in matching structural modules known from one molecule to other molecules for which the 3D structure is not known yet. We have created a pipeline, metaRNAmodules, which completely automates extracting putative modules from the FR3D database and mapping of such modules to Rfam alignments to obtain comparative evidence. Subsequently, the modules, initially represented by a graph, are turned into models for the RMDetect program, which allows to test their discriminative power using real and randomized Rfam alignments. An initial extraction of 22 495 3D modules in all PDB files results in 977 internal loop and 17 hairpin modules with clear discriminatory power. Many of these modules describe only minor variants of each other. Indeed, mapping of the modules onto Rfam families results in 35 unique locations in 11 different families. The metaRNAmodules pipeline source for the internal loop modules is available at http://rth.dk/resources/mrm.


Assuntos
RNA/química , Sequência de Bases , Modelos Moleculares , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Alinhamento de Sequência , Software
17.
Biopolymers ; 99(12): 1124-36, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-23818234

RESUMO

RNA has become an integral building material in synthetic biology. Dominated by their secondary structures, which can be computed efficiently, RNA molecules are amenable not only to in vitro and in vivo selection, but also to rational, computation-based design. While the inverse folding problem of constructing an RNA sequence with a prescribed ground-state structure has received considerable attention for nearly two decades, there have been few efforts to design RNAs that can switch between distinct prescribed conformations. We introduce a user-friendly tool for designing RNA sequences that fold into multiple target structures. The underlying algorithm makes use of a combination of graph coloring and heuristic local optimization to find sequences whose energy landscapes are dominated by the prescribed conformations. A flexible interface allows the specification of a wide range of design goals. We demonstrate that bi- and tri-stable "switches" can be designed easily with moderate computational effort for the vast majority of compatible combinations of desired target structures. RNAdesign is freely available under the GPL-v3 license.


Assuntos
Sequência de Bases , Conformação de Ácido Nucleico , Algoritmos , RNA/química , Dobramento de RNA
18.
Nucleic Acids Res ; 41(Web Server issue): W499-503, 2013 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-23640335

RESUMO

A standard method for the identification of novel non-coding RNAs is homology search by covariance models. Covariance models are constructed for specific RNA families with common sequence and structure (e.g. transfer RNAs). Currently, there are models for 2208 families available from Rfam. Before being included into a database, a proposed family should be tested for specificity (finding only true homolog sequences), sensitivity (finding remote homologs) and uniqueness. The CMCompare webserver (CMCws) compares Infernal RNA family models to (i) identify models with poor specificity and (ii) explore the relationship between models. The CMCws provides options to compare new models against all existing models in the current Rfam database to avoid the construction of duplicate models for the same non-coding RNA family. In addition, the user can explore the relationship between two or more models, including whole sets of user-created family models. Visualization of family relationships provides help in evaluating candidates for clusters of biologically related families, called clans. The CMCws is freely available, without any login requirements, at http://rna.tbi.univie.ac.at/cmcws, and the underlying software is available under the GPL-3 license.


Assuntos
Modelos Estatísticos , RNA não Traduzido/química , Homologia de Sequência do Ácido Nucleico , Software , Internet , RNA de Transferência/química , RNA de Transferência/classificação , RNA não Traduzido/classificação
19.
Mol Phylogenet Evol ; 64(1): 198-203, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22724136

RESUMO

Myzostomida comprise a group of marine worms associated mainly with echinoderms since the Carboniferous. Due to their unusual morphology the phylogenetic position in relation to other Lophotrochozoa is discussed since their description. According to different morphological and molecular markers the Myzostomida are either close to Platyzoa or Annelida. Here we investigated small non-coding RNAs of Myzostoma cirriferum to infer the phylogenetic position of myzostomids. Based on transcriptomic data collected by Illumina Deep Sequencing we analyzed the microRNA (miRNA) families occurring in M. cirriferum. Phylogenetic analysis revealed the presence of 13 miRNA-families exclusively shared by Annelida (including Sipuncula) and Myzostomida, as such highly significantly supporting an annelid origin of myzostomids. Furthermore, using a mapping-approach and secondary structure models we predicted several miRNA-candidates unique for myzostomids.


Assuntos
Anelídeos/classificação , Anelídeos/genética , Filogenia , Animais , Primers do DNA/genética , França , Sequenciamento de Nucleotídeos em Larga Escala/métodos , MicroRNAs/genética , Especificidade da Espécie
20.
Algorithms Mol Biol ; 6: 26, 2011 Nov 24.
Artigo em Inglês | MEDLINE | ID: mdl-22115189

RESUMO

BACKGROUND: Secondary structure forms an important intermediate level of description of nucleic acids that encapsulates the dominating part of the folding energy, is often well conserved in evolution, and is routinely used as a basis to explain experimental findings. Based on carefully measured thermodynamic parameters, exact dynamic programming algorithms can be used to compute ground states, base pairing probabilities, as well as thermodynamic properties. RESULTS: The ViennaRNA Package has been a widely used compilation of RNA secondary structure related computer programs for nearly two decades. Major changes in the structure of the standard energy model, the Turner 2004 parameters, the pervasive use of multi-core CPUs, and an increasing number of algorithmic variants prompted a major technical overhaul of both the underlying RNAlib and the interactive user programs. New features include an expanded repertoire of tools to assess RNA-RNA interactions and restricted ensembles of structures, additional output information such as centroid structures and maximum expected accuracy structures derived from base pairing probabilities, or z-scores for locally stable secondary structures, and support for input in fasta format. Updates were implemented without compromising the computational efficiency of the core algorithms and ensuring compatibility with earlier versions. CONCLUSIONS: The ViennaRNA Package 2.0, supporting concurrent computations via OpenMP, can be downloaded from http://www.tbi.univie.ac.at/RNA.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...